Segmentation of patent claims for improving their readability
نویسندگان
چکیده
Good readability of text is important to ensure efficiency in communication and eliminate risks of misunderstanding. Patent claims are an example of text whose readability is often poor. In this paper, we aim to improve claim readability by a clearer presentation of its content. Our approach consist in segmenting the original claim content at two levels. First, an entire claim is segmented to the components of preamble, transitional phrase and body, using a rule-based approach. Second, a conditional random field is trained to segment the components into clauses. An alternative approach would have been to modify the claim content which is, however, prone to also changing the meaning of this legal text. For both segmentation levels, we report results from statistical evaluation of segmentation performance. In addition, a qualitative error analysis was performed to understand the problems underlying the clause segmentation task. Our accuracy in detecting the beginning and end of preamble text is 1.00 and 0.97, respectively. For the transitional phase, these numbers are 0.94 and 1.00 and for the body text, 1.00 and 1.00. Our precision and recall in the clause segmentation are 0.77 and 0.76, respectively. The results give evidence for the feasibility of automated claim and clause segmentation, which may help not only inventors, researchers, and other laypeople to understand patents but also patent experts to avoid future legal cost due to litigations.
منابع مشابه
Visualization of patent claims structure to improve their readability
Readability is considered as a important element in effective communication. The patent claims are regarded as typical bad example of readability since they are written in long sentences and the structure is complex among different claims. In one patent document, the several claims contain hierarchical structure within each other. In this project, we want to improve the patent claims readabilit...
متن کاملAligning Patent Claims with Detailed Descriptions for Readability
Patent specifications consist of patent claims and detailed descriptions. While patent claims are the most important part of patent specifications, they are compositionally or combinationally described and difficult to read. By aligning patent claims with detailed description, the readability of patent claims can be improved because paraphrases for the claims can be found. In this paper, we pro...
متن کاملPatent Claim Processing For Readability - Structure Analysis And Term Explanation
Patent corpus processing should be centered around patent claim processing because claims are the most important part in patent specifications. It is common that claims written in Japanese are described in one sentence with peculiar style and wording and are difficult to understand for ordinary people. The peculiarity is caused by structural complexity of the sentences and many difficult terms ...
متن کاملNatural Language Analysis Of Patent Claims
We propose a NLP methodology for analyzing patent claims that combines symbolic grammar formalisms with dataintensive methods while enhancing analysis robustness. The output of our analyzer is a shallow interlingual representation that captures both the structure and content of a claim text. The methodology can be used in any patent-related application, such as machine translation, improving re...
متن کاملنقش ادعاهای اختراع در تعیین عرصه فنی حمایت شده توسط حق اختراع؛ «کنکاشی در زیست فناوری»
Patent claims determining the legal protection extent of an invention play an essential role in patent rights. Type and drafting of patent claims constitute indeed the important elements for determining legal protection extent of an invention on the one hand and enforcing patenee's exclusive right against competitors on the other hand. While different drafting or interpreting methods of pate...
متن کامل